Can ILP be Applied to Large Dataset?

نویسندگان

  • Hiroaki Watanabe
  • Stephen Muggleton
چکیده

There exist large data in science and business. Existing ILP systems cannot be applied effectively for data sets with 10000 data points. In this paper, we consider a technique which can be used to apply for more than 10000 data by simplifying it. Our approach is called Approximative Generalisation and can compress several data points into one example. In case that the original examples are mixture of positive and negative examples, the resulting example is ascribed in probability values representing proportion of positiveness. Our longer term aim is to apply on large Chess endgame database to allow well controlled evaluations of the technique. In this paper we start by choosing a simple game of Noughts and Crosses and we apply mini-max backup algorithm to obtain database of examples. These outcomes are compacted using our approach and empirical results show this has advantage both in accuracy and speed. In further work we hope to apply the approach to large database of both natural and artificial domains.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New ILP Model for Identical Parallel-Machine Scheduling with Family Setup Times Minimizing the Total Weighted Flow Time by a Genetic Algorithm

This paper presents a novel, integer-linear programming (ILP) model for an identical parallel-machine scheduling problem with family setup times that minimizes the total weighted flow time (TWFT). Some researchers have addressed parallel-machine scheduling problems in the literature over the last three decades. However, the existing studies have been limited to the research of independent jobs,...

متن کامل

شناسایی نوع و مدل وسیله نقلیه با استفاده از مجموعه بخش‌های متمایز‌کننده

In fine-grained recognition, the main category of object is well known and the goal is to determine the subcategory or fine-grained category. Vehicle make and model recognition (VMMR) is a fine-grained classification problem. It includes several challenges like the large number of classes, substantial inner-class and small inter-class distance. VMMR can be utilized when license plate numbers ca...

متن کامل

A new approach to pharmacophore mapping and QSAR analysis using

A key problem in QSAR is the selection of appropriate descriptors to form accurate regression equations for the compounds under study. Inductive Logic Programming (ILP) algorithms are a class of machine learning algorithm that have been successfully applied to a number of SAR problems. Unlike other QSAR methods, which use attributes to describe chemical structure, ILP uses relations. This gives...

متن کامل

Percutaneous laser photocoagulation of osteoid osteoma: Assessment of treatment in nine cases

ABSTRACT Background: Osteoid osteoma is a benign bony neoplasm and its classic treatment is surgery. In the r ecent decades percutaneous laser therapy was suggested to be replaced by surgery. In this study we have reviewed the results of the first applications of interstitial laser photocoagulation (ILP) for treatment of osteoid osteoma in Iranian patients. Materials and Methods: In this case s...

متن کامل

An ILP Solver for Multi-label MRFS with Connectivity Constraints

Integer Linear Programming (ILP) formulations of Markov random fields (MRFs) models with global connectivity priors were investigated previously in computer vision, e.g., [16, 17]. In these works, only Linear Programing (LP) relaxations [16, 17] or simplified versions [21] of the problem were solved. This paper investigates the ILP of multi-label MRF with exact connectivity priors via a branch-...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010